Tesseract Ocr: a Case Study for License Plate Recognition in Brazil
نویسندگان
چکیده
This paper presents the analysis of Google’s Tesseract OCR for license plate recognition in Brazil. The performance results presented for Tesseract OCR will be compared to market grade OCR products known here as “A” and “B”. This is a necessary measure due to a confidentiality agreement with the company supporting this research. The use of OpenCV is also considered due to limitations inherent to Tesseract OCR.
منابع مشابه
Optical Character Recognition by Open source OCR Tool Tesseract: A Case Study
Optical character recognition (OCR) method has been used in converting printed text into editable text. OCR is very useful and popular method in various applications. Accuracy of OCR can be dependent on text preprocessing and segmentation algorithms. Sometimes it is difficult to retrieve text from the image because of different size, style, orientation, complex background of image etc. We begin...
متن کاملRecognition of Handwritten Roman Script Using Tesseract Open source OCR Engine
In the present work, we have used Tesseract 2.01 open source Optical Character Recognition (OCR) Engine under Apache License 2.0 for recognition of handwriting samples of lower case Roman script. Handwritten isolated and free-flow text samples were collected from multiple users. Tesseract is trained to recognize user-specific handwriting samples of both the categories of document pages. On a si...
متن کاملRecognition of Handwritten Textual Annotations using Tesseract Open Source OCR Engine for information Just In Time (iJIT)
Objective of the current work is to develop an Optical Character Recognition (OCR) engine for information Just In Time (iJIT) system that can be used for recognition of handwritten textual annotations of lower case Roman script. Tesseract open source OCR engine under Apache License 2.0 is used to develop user-specific handwriting recognition models, viz., the language sets, for the said system,...
متن کاملLucrative Method for License Plate Recognition
Recent research initiatives have addressed the need for improved performance of license plate recognition accuracy that would profit many applications, ITS in particular. Different image processing techniques have been implemented for this purpose specifically edge detection, binarization, segmentation algorithm and tesseract. Each of these steps has its own strengths and weaknesses and it has ...
متن کاملComparison of Visual and Logical Character Segmentation in Tesseract OCR Language Data for Indic Writing Scripts
Language data for the Tesseract OCR system currently supports recognition of a number of languages written in Indic writing scripts. An initial study is described to create comparable data for Tesseract training and evaluation based on two approaches to character segmentation of Indic scripts; logical vs. visual. Results indicate further investigation of visual based character segmentation lang...
متن کامل